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Abstract. The debts' clearing problem is about clearing all the debts in 
a group of n entities (e.g. persons, companies) using a minimal number 
of money transaction operations. In our previous works we studied the 
problem, gave a dynamic programming solution solving it and proved 
that it is NP-hard. In this paper we adapt the problem to dynamic graphs 
and give a data structure to solve it. Based on this data structure we 
develop a new algorithm, that improves our previous one for the static 
version of the problem. 

1 Introduction 

In [2] we studied the debts' clearing problem, and gave a dynamic programming 
solution using @{2^) memory and running in time proportional to 3^. The 
problem statement is the following: 

Let us consider a number ofn entities (e.g. persons, companies), and a list 
of m borrowings among these entities. A borrowing can be described by three 
parameters: the index of the borrower entity, the index of the lender entity and 
the amount of money that was lent. The task is to find a minimal list of money 
transactions that clears the debts formed among these n entities as a result of 
the ra borrowings made. 
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Example 1 



Solution: 



Borrower 


Lender 


Amount of money 


1 




2 


10 


2 




3 


5 


3 




1 


5 


1 




4 


5 


4 




5 


10 


Sender 


Reciever 


Amount of money 


1 




5 


10 


4 




2 


5 



In [2] we modeled this problem using graph theory: 

Definition 2 Let G(V, A,W) he a directed, weighted multigraph without loops, 
|V| = n, |A| = m, W : A — ) Z, where V is the set of vertices, A is the set of 
arcs and W is the weight function. G represents the borrowings made, so we 
will call it the borrowing graph. 

Example 3 The borrowing graph corresponding to Example 1 is shown in 
Figure 1. 




Figure 1: The borrowing graph associated with the given example. An arc 
from node i to node j with weight w means, that entity i must pay w amount 
of money to entity j . 

Definition 4 Let us define for each vertex v G V the absolute amount of 
debt over the graph G: Dg(v) = Y. W(v,v') - Y. W(v",v) 

v' e V v" 6 V 
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Definition 5 Let G'(V, A',W'] be a directed, weighted multigraph without 
loops, with each arc (i, j) representing a transaction o/W'(l, j] amount of 
money from entity i to entity ] . We will call this graph a transaction graph. 
These transactions clear the debts formed by the borrowings modeled by graph 
G(V, A, W) if and only if: 

DG(vi) = DG'(vi),Vi = l,rL, where V = {vi,V2, . . . ,Vn}. 

We will note this by: G ^ G' . 

Example 6 See Figure 2 for a transaction graph with minimal number of arcs 
corresponding to Example 1. 



Figure 2: The respective minimum transaction graph. An arc from node i to 
node j with weight w means, that entity i pays w amount of money to entity 
j- 

Using the terms defined above, the debt's clearing problem can be reformu- 
lated as follows: 

Given a borrowing graph G(V, A,W) we are looking for a minimal tran- 
saction graph Grain Amin, VVVnln); SO that G ~ Grain and VG'(V,A',W') : 
G ~ GMArninl < lA'i holds. 

2 The debts' clearing problem in dynamic graphs 

Definition 7 A dynamic graph is a graph, that changes in time, by under- 
going a sequence of updates. An update is an operation, that inserts or deletes 
edges or nodes of the graph, or changes attributes associated to edges or nodes. 
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In a typical dynamic graph problem one would like to answer queries re- 
garding the state of the graph in the current time moment. A good dynamic 
graph algorithm will update the solution efficiently, instead of recomputing it 
from scratch after each update, using the corresponding static algorithm [1]. 

In the dynamic debts' clearing problem we want to support the following 
operations: 

• InsertNode(u) - adds a new node u to the borrowing graph. 

• RemoveNode(u) - removes node u from the borrowing graph. In order 
for a node to be removed, all of its debts must be cleared first. In order to 
affect the other nodes as little as possible, the debts of u will be cleared 
in a way that affects the least number of nodes, without compromising 
the optimal solution for the whole graph. 

• InsertArc(u, v,x) - insert an arc in the borrowing graph. That is, u 
must pay x amount of money to v. 

• RemoveArc(u, v) - removes the debt between u and v. 

• Query() ~ returns a minimal transaction graph. 

Example 8 For instance calling the Query operation after adding the third 
arc in the borrowing graph corresponding to Example 1 would result in the 
minimal transaction graph from Figure 3. 

These operations could be useful in the implementation of an application 
that facilitates borrowing operations among entities, such as BillMonk [5] or 
Expensure [6]. When a new user registers to the system, it is equivalent with 
an InsertNode operation, and when a user wants to leave the system it is the 
same as a RemoveNode. When a borrowing is made, it can be implemented 
by a simple call of Insert Arc. Two persons may decide, that they no longer 
owe each other anything. In this case RemoveArc can be useful. If the whole 
group decides, that it is time to settle all the debts, the Query operation will 
be used. 

3 A data structure for solving dynamic debts' clear- 
ing 

As the static version of the problem is NP-hard [3] , it is not possible to support 
all these operations in polynomial time (unless P = NP). Otherwise we could 
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Figure 3: Result of the Query operation called after the third arc was added 

just build up the whole graph one arc at a time, by m calls of InsertArc, 
then construct a minimal transaction graph by a call of Query, which would 
lead to a polynomial algorithm for the static problem. 

Our data structure used to support these operations is based on main- 
taining the subset of nodes, that have non-zero absolute amount of debt 
V* = {u|D(u) ^ 0}. The sum of D values for all the 2l^*l subsets of V* is 
also stored in a hash table called sums. 

3.1 InsertNode 

As for our data structure only nodes having non-zero D values are important, 
and a new node will always start with no debts, it means that nothing has to 
be done when calling InsertNode. 

3.2 InsertArc 

When InsertArc is called, the D values of the two nodes change, so V* can 
also change. When a node leaves V*, we do not care about the updating the 
sum of the subsets it is contained in, because when a new node enters V* we 
will have to calculate the sum of all of the subsets it is contained in anyway. 

If both u and v were in V* and remained in it after changing the D values, 
then we simply add x to the sum of all subsets containing u, but not v, 
and subtract x from those containing v but not u. The sum of the subsets 
containing both nodes does not change. 
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If one of the nodes was just added to V* (D[u] = x, or D[v] = — x), then 
all the sums of the subsets containing it must be recalculated. This recalcu- 
lation can be done in 0(1) for each subset, taking advantage of sums already 
calculated for smaller subsets. 
Procedure 1: UpdateSums(u, v, x) 

// Updates the sum of all subsets containing u but not v 

1 foreach S C V* , such that u G S and v S do 

2 if D[u] = X then sums[S] := sums[S \{u}] + x; 

3 else sums[S] := sums[S] +x; 



Algorithm 2: InsertArc(u, v, x) 



1 if D[u] = then V* := V* U {u}; 

2 if D[v] = then V* := V* U {v}; 

3 D[u] := D[u] +x; D[v] := D[v] -x; 

4 if D[u] = then V* := V* \ {u}; 

5 if D[v] = then V* := V* \ {v}; 

6 if D[u] 7^ then UpdateSums (u, v,x); 

7 if D[v] 7^ then UpdateSums (v,u, — x) 

8 if D[u] = X or D[v] = — x then 



9 
10 



foreach S C V* , such that u, v G S do 
1^ sums[S] := sums[S \ {u, v}] + D[u] + D[v] 



One call of UpdateSums iterates over 2'^*'^^ subsets, thus lines 6 and 7 of 
Insert Arc together take 2'^*'^^ steps. Additionally line 9 takes 2'^*'^^ more 
steps. 



3.3 Query 

To carry out Query we observe, that finding a minimal transaction graph 
is equivalent to partitioning V* in a maximal number of disjoint zero-sum 
subsets, more formally V* = Pi U . . . U Pmaxi sums[Pt] = 0, Vi = 1 , max and 
Pi n Pj = 0,VI, j = l,max, I 7^ j. The reason for this is, that all the debts in a 
zero-sum subset Pi, can be cleared by |Pi| — 1 transactions (see [2, 3, 4]), thus 
to clear all the debts, |V*| — max transactions are necessary. Let S'^ be the set 
of all subsets of V*, having zero sum: S'^ = {S|S C V*, sums[S] = 0}. Then, to 
find the maximal partition, we will use dynamic programming. 

Let dp[S] be the maximal number of zero-sum sets, S C V* can be parti- 
tioned in. 
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{not defined, if suTrLs[S] / 

0, if S = 

max{dp[S \ S'] + 1 |S' C S, S' G S°}, otlierwise. 

Building dp takes at most 2'^*' • |S'^| steps. 

As the speed at which Query can be carried out depends greatly on the 
size of S^, we can use two heuristics to reduces its size, without compromising 
the optimal solution. To facilitate the running time of these heuristics, S'^ can 
be implemented as a linked list. 



Clear pairs Choosing sets containing exactly two elements in the partition 
will never lead to a suboptimal solution, if the remaining elements are parti- 
tioned correctly [4]. Thus, before building dp, sets having two elements can be 
removed from S'^, along with all the sets, that contain those two elements (be- 
cause we already added them to the solution, so there is no need to consider sets 
that contain them in the dynamic programming): S*^ := S'^\ ({u,v}U{S'|u G S' 
or V G S'}). 



Procedure 3: ClearPairs 



1 max := 0; 

2 inPair := 0; 

3 foreach S G S*^ do 



if !S| = 2 then 

if S n inPair = then 
max := max + 1 ; 

Pmax ■ — S, 

inPair := inPair U S ; 



9 foreach S G S*^ do 
10 L if |S| n inPair / then S° := 



S°\S; 



The running time of this heuristic is 0(|S |). 

Clear non-atomic sets If a set Si G S*^ is contained in another set Sj G S'^, 
then Sj can be safely discarded, because Sj \ Si will also be part of S'^, and 
combining S^ with Sj \St always leads to a better solution, than using Sj alone: 
S° := S°\{SjPSiGS°:SiCSj}. 

This heuristic can be carried out in 0(|S'^P). 
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Procedure 4: Clear NonAtomic 



1 foreach Si G do 



foreach Sj G S , St / Sj do 
L if SiCSj then S°:=S°\Sj; 



3.4 RemoveNode 

To delete a node u with the conditions listed in the introduction is equivalent 
to finding a set P of minimal cardinality containing u, that can still be part 
of an optimal partition, that is dp[V*] = dp[V* \ P] + 1. This algorithm can 
not be used together with the Clear pairs heuristic, because clearing pairs 
may compromise the optimal removal of u. The running time is the same as 
for Query, because dp must be built. 

3.5 Remove Arc 

Because clearing an arc between two nodes is the same as adding an arc in 
the opposite direction, this can be easily implemented using Insert Arc. If 
the D values of the two nodes have the same sign, it means, that no arc could 
appear in a minimal transaction between the two nodes, so nothing has to be 
done. 

Algorithm 5: RemoveArc(u, v) 

1 if D[u] < and D[v] > then lNSERTARC(u,v,min{— D[u], D[v]}); 

2 else 

3 1^ if D[u] > and D[v] < then Insert Arc (v,u,min{D[u],—D[v]}); 



3.6 Implementation details 

In our implementation we used 32-bit integers to represent subsets. A subset 
of at most 32 nodes can be codified by a 32-bit integer by looking at its binary 
representation: node i is in the subset if and only if the i^^ bit is one. This 
idea allows using bit operations to improve the running time of the program. 

Because we did not use test cases having more than 20 nodes, the hash table 
sums was implemented as a simple array having elements. V* was stored 
as an ordered array, but other representations are also possible, because the 
running time of the operations on V* is dominated by other calculations in 
our algorithm. 
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Before using the methods of the data structme for the first time, the memory 
for its data fields containing V* and sums should be allocated and their values 
initialized, both being empty at the beginning. In a destructor type method 
these memory fields can be deallocated. 

4 A new algorithm for the static problem 

We can observe, that the Query operation needs only the set S'' to be built, 
and in order to build the sum of all subsets of V* needs to be calculated. 
Thus, after processing all the arcs in 0(m] time and finding the D values, we 
build V* in ©{n) time along with the sums hash table, that can be built in 
0(2'^ ') by dynamic programming: 

r 0, if S = 

sums[S = {si, . . . Sk}] = < D[si], if |S| = 1 

[ sums [{s2, . . . S\J\ + D [si ] , otherwise. 

After sums is built, we can construct by simply iterating once again 
over all the subsets of V* and adding zero-sum subsets to S*^. Then we clear 
pairs and non-atomic sets, call Query and we are done. This yields to a total 
complexity of e(m + n + 2l^*l + \S°\^ + 21^*1 • |S°|). 



5 Practical behavior 

As it can be seen from the time complexities of the operations, the behavior of 
the presented algorithms depends on the cardinalities of V* and S*^ and their 
running times may vary from case to case. 

We have made some experiments to compare our new algorithms and the 
static algorithm presented in [2]. We used the same 15 test cases which were 
used, when the problem was proposed in 2008 at the qualification contest of 
the Romanian national team. Figure 4 contains the structure of the graphs 
used for each test case. 

In our first experiment we compared three algorithms: the old static al- 
gorithm based on dynamic programming from [2], our new static algorithm 
described in Section 4 and the dynamic graph algorithm based on the data 
structure presented in Section 3. For the third algorithm we called Insert Arc 
for each arc, then Query once in the end, after all arcs were added. 

We executed each algorithm three times for each test case, and computed the 
average of the running times. The new static algorithm was the fastest in nine 
test cases, while the old static algorithm was the fastest in the remaining six 
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6 


20 


10 


10 


Yields to D[i] = 99, Vi = 1, 10, D[i] = — 99,Vi = 
1 1 , 20, maximizing the number of pairs 


7 


20 


19 


12 


A path with random weights having close values 
(50± 10) 


8 


20 


20 


10 


A cycle with random weights having close values 
(50± 10) 
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10 


100 
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Random graph with weights < 10 


10 


12 


100 
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Random graph with weights < 10 


11 
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100 


11 


Random graph with weights < 1 
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Random graph with weights < 10 
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A path with consecutive weights 
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Ten pairs, a path, a star and triples put together 
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20 
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15 


Dense graph with weights < 3 



Figure 4: The structure of the test cases 



test cases. Looking at the average running time over all the test cases, the new 
static algorithm was clearly the fastest with an average of 0.08 seconds. The old 
static algorithm came second with 0.64 seconds, and the dynamic algorithm 
third with 1.22 seconds. The difference between the last two is surprisingly 
small, taking into account that the dynamic algorithm may perform 27^ steps 
after each arc insertion. Running times are shown in Figure 5. 

In the second experiment we used the same methodology to compare our 
new dynamic algorithm and the static algorithm presented in [2] . For the first 
algorithm the solution was recomputed from scratch each time an arc was read 
from the input file, and for the second after each InsertArc a Query was also 
executed. The dynamic algorithm was faster for eight test cases, recalculating 
from scratch was faster in the other seven cases. The average running time 
over all test cases is 23.6 seconds for the first algorithm and 41.9 seconds for 
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91.3% 
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-909.7% 
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0.169 


-145.2% 
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0.084 


-26.6% 
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45.5% 


10 


0.014 


0.007 


0.022 


47.6% 


11 


0.015 


0.008 


0.106 


44.4% 


12 


0.016 


0.056 


5.526 


-242.8% 


13 


0.765 


0.079 


0.465 


89.6% 


14 


0.013 


0.048 


1.527 


-271.7% 


15 


2.274 


0.218 


8.553 


90.3% 



Figure 5: Average rmming times for the first experiment, all given in seconds. 
The best running times are bolded for each test. The last column shows the 
improvement of the new static algorithm over the old one in percentage. A 
negative value means, that no improvement was done. 

the dynamic algorithm, mostly due to the last test case which runs for a long 
time compared to the others. Without taking into account the last test case 
the average running times are 1.05 and 1.28 seconds respectively. 

By comparing the last two columns of the table depicted in Figure 6, one can 
see how powerful our heuristics are, reducing the cardinality of S'^ by several 
magnitudes in many cases. We can observe, that the dynamic algorithm usually 
performs slower than recomputing from scratch, when the size of S*^ before 
applying the heuristics is quite large, at least several hundreds. The reason 
behind this is probably the quadratic complexity of clearing non-atomic sets. 

6 Conclusions 

In this paper we introduced a new data structure capable of supporting arc 
insertions and deletions, node insertions and deletions in a dynamic borrow- 
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Figure 6: Average running times for the second experiment, aU given in sec- 
onds. The best running times are bolded for each test. The third column shows 
the improvement of the dynamic algorithm over recomputing from scratch with 
the old static one in percentage. A negative value means, that no improvement 
was done. The last three columns show the average cardinality of V*, S° and 
S" after applying both heuristics respectively. 

ing graph, along with finding the minimal transaction graph. Using this data 
structure we developed a new static algorithm, which is faster than the one 
described in [2] in many cases and in average. 

We find the running times of the dynamic algorithm and recomputing from 
scratch with the old static algorithm to be comparable on average. With a 
good heuristic, that runs in reasonable time, but still reduces the size of S*^ 
significantly, a better performance could be possible for the dynamic algorithm. 
Finding such a heuristic remains an open problem. 

Our experiments are not meant to be an exact comparison among the al- 
gorithms, as the running time can greatly depend on the details of the imple- 
mentation. Their purpose was just to get a general overview on the behavior 
of the various algorithms for different kind of graphs. 
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